PAC Algorithms for the Infinitely-Many Armed Problem with Multiple Pools
نویسندگان
چکیده
We consider a multi-pool version of the infinitely-many armed bandit problem, where a learning agent is faced with several large pools of items, and interested in finding the best item overall. At each time step the agent chooses a pool, and obtains a random item whose value is precisely revealed. The obtained values within each pool are assumed to be i.i.d., with an unknown probability distribution that generally differs among the pools. Under the PAC framework, we provide lower bounds on the sample complexity of any ( , δ)-correct algorithm, and propose an algorithm that attains this bound up to logarithmic factors. We compare the performance of this multi-pool algorithm to the variant in which the pools are not distinguishable by the agent and are chosen randomly at each stage. Interestingly, when the supremal values of the pools happen to be similar, the latter approach may provide better performance.
منابع مشابه
Pure Exploration in Infinitely-Armed Bandit Models with Fixed-Confidence
We consider the problem of near-optimal arm identification in the fixed confidence setting of the infinitely armed bandit problem when nothing is known about the arm reservoir distribution. We (1) introduce a PAC-like framework within which to derive and cast results; (2) derive a sample complexity lower bound for near-optimal arm identification; (3) propose an algorithm that identifies a nearl...
متن کاملInfinitely Many Solutions for a Steklov Problem Involving the p(x)-Laplacian Operator
By using variational methods and critical point theory for smooth functionals defined on a reflexive Banach space, we establish the existence of infinitely many weak solutions for a Steklov problem involving the p(x)-Laplacian depending on two parameters. We also give some corollaries and applicable examples to illustrate the obtained result../files/site1/files/42/4Abstract.pdf
متن کاملA VARIATIONAL APPROACH TO THE EXISTENCE OF INFINITELY MANY SOLUTIONS FOR DIFFERENCE EQUATIONS
The existence of infinitely many solutions for an anisotropic discrete non-linear problem with variable exponent according to p(k)–Laplacian operator with Dirichlet boundary value condition, under appropriate behaviors of the non-linear term, is investigated. The technical approach is based on a local minimum theorem for differentiable functionals due to Ricceri. We point out a theorem as a spe...
متن کاملExistence of multiple solutions for Sturm-Liouville boundary value problems
In this paper, based on variational methods and critical point theory, we guarantee the existence of infinitely many classical solutions for a two-point boundary value problem with fourth-order Sturm-Liouville equation; Some recent results are improved and by presenting one example, we ensure the applicability of our results.
متن کاملPAC Subset Selection in Stochastic Multi-armed Bandits
We consider the problem of selecting, from among the arms of a stochastic n-armed bandit, a subset of size m of those arms with the highest expected rewards, based on efficiently sampling the arms. This “subset selection” problem finds application in a variety of areas. In the authors’ previous work (Kalyanakrishnan & Stone, 2010), this problem is framed under a PAC setting (denoted “Explore-m”...
متن کامل